AN IMPLICITIZATION CHALLENGE 
FOR BINARY FACTOR ANALYSIS 

MARIA ANGELICA CUETO, ENRIQUE A. TOBIS, AND JOSEPHINE YU 

Abstract. We use tropical geometry to compute the multidegree and Newton poljrtope 
of the hypersurface of a statistical model with two hidden and four observed binary 
random variables, solving an open question stated by Drton, Sturmfels and SuUivant in 
[S] Problem 7.7]. The model is obtained from the undirected graphical model of the 
complete bipartite graph K2,4 by marginalizing two of the six binary random variables. 
We present algorithms for computing the Newton polytope of its defining equation by 
parallel walks along the polytope and its normal fan. In this way we compute vertices of 
the polytope. Finally, we also compute and certify its facets by studying tangent cones 
of the polytope at the symmetry classes vertices. The Newton polytope has 17214 912 
vertices in 44 938 symmetry classes and 70 646 facets in 246 symmetry classes. 



1. Introduction 

In recent years, a fruitful interaction between (computational) algebraic geometry and 
statistics has emerged, under the form of algebraic statistics. The main objects studied 
by this field are probability distributions that can be described by means of polynomial 
or even rational maps. Among them, an important source of examples are the so called 
graphical models. In this paper, we focus our attention on a special model: the undirected 
{4:, 2) -binary factor analysis model 

First, let us describe our main player. Consider the complete undirected bipartite graph 
^2,4 with four observed nodes Xi, X2, X^, X4 and two hidden nodes Hi,H2 (cf. Figure [T]). 
Each node represents a binary random variable and each edge represents a dependency 
between two random variables. In other words, if there is no edge between two random 
variables, then they are conditionally independent given the rest of the variables. We 
obtain a hidden model from this undirected graphical model by marginalizing over Hi and 
H2. This model is the discrete undirected version of the factor analysis model discussed 
in [6l Section 4.2]. The model and its immediate generalization J-'m,n is closely related 
to the statistical model describing the behavior of restricted Boltzmann machines [16j, 
which are widely discussed in the Machine Learning literature. Here, J^rn,n is the binary 
undirected graphical model with n hidden variables and m observed variables encoded 
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in the complete bipartite graph Km,n- The main invariant of interest in these models is 
the expected dimension, and, furthermore, lower bounds on n such that the probability 
distributions are a dense subset of the probability simplex A2»"-i. By direct computation, 
it is easy to show that 7^2,2 and ^"3^2 are dense subsets of the corresponding probability 
simplices, so J-4^2 is the first interesting example worth studying. Understanding the model 
J-4^2 can pave the way for the study of restricted Boltzmann machines in general [2]. 



Hi H2 




Xi X2 X3 X4 



Figure 1. The model Each node represents a binary random variable. 

The set of all possible joint probability distributions {Xi, X2, X^, X/j^) that arise in this 
way forms a semialgebraic variety Ai in the probability simplex A15. To simplify our 
construction, we disregard the inequalities defining the model and we extend our param- 
eterization to the entire affine space C^^. In other words, we consider the Zariski closure 
of the joint probability distributions in C^^. As a result of this, we obtain an algebraic 
subvariety of C^^ which carries the core information of our model. In turn, we projectivize 
the model by considering its associated projective variety. This variety is expected to have 
codimension one and be defined by a homogeneous polynomial in 16 variables. 

Problem. (An Implicitization Challenge, [6, Ch. VI, Problem 7.7]) Find the degree and 
the defining polynomial of the model M. 



Our main results state that the variety is a hypersurface of degree 110 in (Theorem 



4.2) and explicitly enumerate all vertices and facets of the polytope (Theorem |4.l| ). Our 
methods are based on tropical geometry. Since the polynomial is multihomogeneous, we 
get its multidegree from just one vertex. Interpolation techniques will allow us to compute 
the corresponding irreducible homogeneous polynomial in 16 variables, using the lattice 
points in the Newton polytope. However, this polytope will turn out to be too big for 
interpolation to be practically feasible. 

The paper is organized as follows. In Section [2] we describe the parametric form of our 
model and we express our variety as the Hadamard square of the first secant of the Segre 
embedding In Section [s] we present the tropical interpretation 

of our variety. By means of the nice interplay between the construction described in Sec- 
tion [2] and its tropicalization, we compute this tropical variety as a collection of cones with 
multiplicities. We should remark that we do not obtain a fan structure, but, nonetheless, 
our characterization is sufficient to fulfill the goal of the paper. The key ingredient is the 
computation of multiplicities by the so called push- forward formula [23t. Theorem 3.12] 
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which we generaUze to match our setting (Theorem 3.4). We finish Section |3] by describing 
the effective computation of the tropical variety and discussing some of the underlying 
combinatorics. 

In Section [4] we compute the multidegree of our model with respect to a natural 5- 
dimensional grading, which comes from the tropical picture in Section[3| Once this question 
is answered, we shift gears and move to the study of the Newton polytope of our variety. 
We present two algorithms that compute vertices of this polytope by "shooting rays" 
(Algorithm [T]) and "walking" from vertex to vertex in the Newton polytope (Algorithm [2]) . 
Using these methods, and also taking advantage of the B4 symmetry of the polynomial and 
the Newton polytope, we compute all 17 214 912 vertices our polytope (in P44 938 orbits 
under which shows the intrinsic difficulties of this "challenging" problem. Along the 
way, we also compute the tangent cones at each symmetry class of vertices and certify the 
facet normal directions by looking at the local behavior of the tropical variety around these 
vectors (after certifying they belong to the tropical variety). In particular, by computing 
dimensions of a certain linear space (Algorithm |3]) we can check if the vector is a ray of the 
tropical variety. In this way, we certify all 246 facets of the polytope modulo symmetry. 
We believe these methods will pave the way to attack combinatorial questions about high 
dimensional polytopes with symmetry as the one analyzed in this paper. 



2. Geometry of the model 

We start this section by describing the parametric representation of the model we wish 
to study. Recall that all our six random variables are binary, with four observed nodes and 
two hidden ones. Since the model comes from an undirected graph (see [6l HH]), we can 
parameterize it by a map p: M^^ — t- M^^, where 
1 1 

Pijki = X] X] "-sibsjCskdsierifrjgrkhri for ah {i, j, k, I) E {0, 1}^. 

s=0 r=0 

Notice that our coordinates are homogeneous of degree 1 in the subset of variables cor- 
responding to each edge of the graph. Therefore, there is a natural interpretation of this 
model in projective space. On the other hand, by the distributive law we can write down 
each coordinate as a product of two points in the model corresponding to the 4-claw tree, 
which is the first secant variety of the Segre embedding P-*^ x x P-^ x P-*^ ^ P-^^ (|2]), i-e. 

1 1 

p: (Pi xpi)8 ^Pi5 p..^; = {J2as^bsjCskdsl){J2^rifrJ9rkKl) ^{i,j,k,l) E {0,1}^ 

s=0 r=0 

From this observation it is natural to consider the Hadamard product of projective varieties: 

Definition 2.1. Let X,Y G P"~i be two projective varieties. The Hadamard product of 
X and Y is 

X.Y = {(xoyo : • • • : x„_i2/„_i) | x E X, y E x . y / 0} C P""\ 
where x.y = (xoyo, • • • , Xn^iVn-i) G C". 
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Note that this structure is well-defined since each coordinate is bihomogeneous of degree 
(1,1). The next proposition follows from the construction. 

Proposition 2.2. The algebraic variety of the model is Ai = X . X where X is the first 
secant variety of the Segre embedding 



Notice that the binary nature of our random variables enables us to define a natural 
S2-action by permuting the values and 1 on each index in our 4-tuples. Combining this 
with the S4-action on the 4-tuples of indices, we see that our model comes equipped with a 
natural §4 X (S2)^-action. In other words, the 16 coordinates pjjfc^ of P"*^^, for A;, / G {0,1}, 
are in natural bijection with the vertices of a 4-dimensional cube. Assuming is a 
hypersurface (as we will prove in Section [s]), its defining polynomial is invariant under the 
group i?4 of symmetries of the 4-cube, which has order 384. This group action will be 
extremely helpful for our computations in the next two sections. 

We now describe the ideal associated to the secant variety Sec(P"'^ x P^ x P^ x P-*^). 



The Segre embedding P-^ x P x P^ x P P^ has a monomial parameterization Pijki = 
Ui ■ Vj ■ Wk- xi for i,j, k, I £ {0, 1}. Its defining prime ideal is generated by the 2 x 2-minors 
of all three 4 x 4-flattenings, together with some 2 x 2-minors of the 2 x 8-flattenings [9l 
Section 3]: 



'(12|34) : 



/ POOOO POOOl POOlO POOll \ 

Poloo Poioi Poiio Pom 

Piooo Piool Piolo Pom 

Vpiioo Piioi Piiio Pun / 



F, 



(13|24) 



/ PODOO POOOl POlOO POlOl \ 

Poolo Pooii Poiio Pom 

PlOOO PlOOl PllOO PllOl 

Vpioio Pioii Piiio Pun / 



F 



(14|23) - 



/ POOOO POOlO POlOO Poiio \ 

Poool Pooii Poioi Pom 
Piooo Piolo Piioo Pino 
Vpiooi Pioii Pnoi Pnn / 
In turn, the defining ideal of the first secant variety of the Segre embedding can be computed 
from the previous three 4 x 4- flattening matrices. We state the result for the case of the 
variety we are studying, although the set-theoretic result is also true for an arbitrary 
number of observed nodes. 

Theorem 2.3 ([3 US]). The secant variety X = S'ec(P^ x P^ x P^ x P^) C P^^ is the 
nine- dimensional irreducible subvariety consisting of all 2 x 2 x 2 x 2-tensors of tensor 
rank at most 2. The "prime ideal of X is generated by all the 3 x 3-minors of the three 
flattenings. 



3. Tropicalizing the model 

In this section we define tropicalizations of varieties in C" and compute the tropicaliza- 
tion of M. See |ll[20] for more details about tropical varieties. 

Definition 3.1. For an algebraic variety X C C" not contained in a coordinate hyperplane 
and with defining ideal I = I{X) C K[xi, . . . the tropicalization of X or / is defined 
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as: 

T{X) = T{I) = {w G M" I in^(/) contains no monomial}, 

where in^(/) = {mw(f) : f £ I), and in^(/) is the sum of all nonzero terms of / = CaX°' 
such that a ■ w is maximum. 

Alternatively, when working with subvarieties of tori V C (C*)" we consider the defining 
ideal I over the ring of Laurent polynomials and set 

T{V) = T{I) = {w; G M" I in^(/) / (1)}. 

Both definitions agree if we consider X to be the Zariski closure of V in C" . We would go 
back and forth between these two definitions. 

The tropical variety T{I) is a polyhedral subfan of the Grobner fan of /. If / is a 
prime ideal containing no monomials, then 'T{I) is pure of the same dimension as X and 
is connected in codimension one The set {w G T{I) : in^(/) = J} is a linear space in 
M" and is called the lineality space of the fan T{I) or the homogeneity space of the ideal 
/. This space can be spanned by integer vectors, which form a primitive lattice A. This 
lattice encodes the action of a maximal torus on X, given by a diagonal action. All cones 
in T{I) contain this linear space. 

In addition to their polyhedral structure, tropical varieties are equipped with integer 
positive weights on all of their maximal cones. We now explain how these numbers can be 
constructed. A point w G T{I) is called regular if T{I) is a linear space locally near w. The 
multiplicity of a regular point w is the sum of multiplicities of all minimal associated 
primes of the initial ideal in^(/). See [H Section 3.6] for definitions. The multiplicity of a 
maximal cone a C T(/) is defined to be equal to for any w £ a in its relative interior. 
It can be showed that this assignment does not depend on the choice of w. With these 
multiplicities, the tropical variety satisfies the balancing condition [22] . 



As we discussed in the previous section (Proposition 2.2) our variety is expressed as a 
Hadamard power of a well-known variety. This Hadamard square has a dense set which can 
be parameterized in terms of a monomial map (the coordinatewise product of two points). 
The integer matrix of exponents corresponding to this monomial map is {In \ In) C Z"^^". 
Although tropicalization is not functorial in general, it has nice properties if we restrict it 
to monomial maps between subvarieties of tori. 

We now describe the tropicalization of monomial maps. Let Ahe a dx r integer matrix 
defining a monomial map a: {C*Y — )• (C*)'^ and a linear map A: M*" — J- R"^ defined by left 
multiplication by this matrix. 

Theorem 3.2. [22l [2l] Let V C {C*Y be a subvariety. Then 

T{a{V)) = A{T{V)). 

Moreover, if a induces a generically finite morphism of degree 5 on V, then the multiplicity 
of T{a{V)) at a regular point w is 

rriu) = \ - y^m^ ■ index (L^, fl Z'^ : ^(L^ n Z'")), 

V 
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where the sum is over all points v £ T{V) with Av = w. We also assume that the 
number of such v is finite, all of them are regular in T{V), and L„,L^ are linear spans of 
neighborhoods of v €z T{V) and w S ATiV) respectively. 

At first sight, the hypothesis of this theorem is not satisfied by our variety because 
the map ct^xxx ^'^^ generically finite. However it is very close to having this finiteness 
behavior. Namely, after taking the quotient X' of X by a maximal torus action, and a 
choice of a suitable monomial map a, the map <5|^,^^, becomes generically finite and we 



can apply Theorem 3.2 We now explain this reduction process. 

Let V C (C*)'' a subvariety, a: {C*Y — (C*)'^ a monomial map, and let W = a{V). 
Consider the lineality space M (g)z A C ^(F), and let A' = A{A). We identify M ®z A with 
a Z-basis of A = (M 0z A) HZ''. Notice that A' need not be a primitive lattice in Z'^ in 
general. Cah (A')'''^* its saturation in Z'^, that is (A')""^* = (M (g)^ A') n Z'^. We know by 



construction and Theorem 3.2 that M (8>z A' is contained in the lineality space of T{W). 
Therefore, we can consider the linear map between these tropical varieties after moding 
out by M (8>z A and M <Siz A' respectively. As we mentioned earlier, the lineality space of 
each tropical variety determines the maximal torus action. For example, (C*)^ acts on V 
by t ■ {xi, . . . ,Xn) '■= {t'^'^xi, . . . jf^^Xr) where a lies in A. 

The linear map A sends A onto A', inside the lineality space of T{a(y)). In addition, 
the monomial map a is compatible with the torus actions on V and a{V). In particular, 
the equality a(A 0^ C*) = A' (g)^ C* induces an action on 1^ by a subtorus (the one 
corresponding to the primitive lattice (A')*^*^*). Thus, we can take the quotient of V and 
W by the corresponding actions of tori H and H' . We obtain the commutative diagram: 

(3.1) V 



V' = V/H W/H' = W. 

Here, H = A^zC* ^ (c*)dimA ^^^^ ^/ ^ ^/ ^* ^ ^^*-^dimA'_ ^^^^^ ^ -g primitive 
sublattice of Z*", it admits a primitive complement in Z^. Fix one of them and call it A"*". 
Note that this complement need not be the usual orthogonal complement. 

Assume for simplicity that A' is a primitive sublattice of Z*^. Therefore, we can identify 
a with the monomial map corresponding to the linear map: 

A': (MoZ'~)/(MO A) = M A^ =:(M(g) A)-^^ (M0Z7(M0 A') =M0 A'^=: (M(g) A')"^. 

Since A is primitive, (M (g A)-*" n Z*" = A"*", and likewise for A'-*". 

To simplify notation, call L := M iS" A and L' := M iS" A'. From the construction it is easy 
to see that T{V') = T{V)/L and T{W') = T{W)/L' as sets. But in fact, they agree as 
weighted balanced polyhedral fans. More precisely. 

Lemma 3.3. Let X C (C*)*" and let L be a subspace of the lineality space of the tropical 
variety T{X) generated by integer vectors. Then T{X) / L is a balanced weighted polyhedral 
fan where the multiplicities at regular points w' are defined as m^i = mw for any w in 
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the fiber of w' under the projection map. With these weights, T{X) / L coincides with the 
tropical variety T{X'), where X' is the quotient of X by the torus (LnZ^')(X'2C* = (C*)'^™^, 
which is a subtorus of the maximal torus acting on X . 

Proof. By definition, we know that ini„+i(/) = in^(I) for any w E W. Let I := dimL. Call 
A := L n the underlying lattice of L. Since A is a primitive lattice, we can extend any 
Z-basis of A to a Z-basis of Z'^. Thus, after a linear change of coordinates (i.e. a monomial 
change of coordinates given by this new Z-basis of Z'') we can assume A = Z(ei, . . . ,ei). 
And in this case, we can pick the direct summand A"*" of A to be Z(e;+i, . . . , e^). In 
particular, the projection map ir: X ^ X' = X/H corresponds to the monomial map 
a: {C*Y — )• (C*)'""' determined by the integer matrix A E ^'"^('""0^ whose columns are a 
Z-basis of A"*-. 

By construction, / = I{X) C C[x]'^^, . . . , xf^] is homogeneous with respect to the grading 
deg(xj) = Cj for i < / and deg(xj) = for j > I. Since any homogeneous Laurent polyno- 
mial is of the form / = x"g{xi^i, . . . , Xr), we see that / is generated by Laurent polynomials 
in the variables {x/4.1, . . . Xr-}. Call gi,...,gs these generators. Therefore I' = I{X') = 
{gi{xi+i, . . .,Xr), ■ ..gs{xi+i, . . .,Xr)) C C[x^\, . . .,xf^] and / = I'C[xf^ -=^^1 



r 



From Theorem 3.2 we know that TiX') = AT{X) = T{X)/L as sets. Moreover, since 
the subspace L lies in all cones of TiX), then the set T{X') which is the quotient of T{X) 
by L has a natural fan structure inherited from the one of T{X). By definition, if w' 
is a regular point in T{X') then any lifting point va. w + L would be a regular point in 
T{X). Moreover, = in^/(I')C[xf . . . , xf^]. In particular, a primary decomposition 

'm.yji{I') determines a primary decomposition of 'm.w{I) by extending each ideal to the whole 
Laurent polynomial ring in n variables. Therefore, to show m^i = it suffices to show 
that the multiplicity of any minimal prime P C C[x^\, . . . , x^^] of in^„/(/') equals the 
multiplicity of P C C[xf^,...,xf^ in in^(/). This claim follows from the definition of 
multiplicity. More precisely: 

^P- A- A- Sp[xf}...,xf] 

m(-P, m^/(J )) = dim Sp -— = dim^ Sp ^,^±1 ^±1 



^ 5pin^,(,,) i^)l.n..,.t^] Sp[xf}. . .,xf]in^,{r) 

A- S[xt}...,xf]p 

where S = C [xf_^-^ , ■ ■ ■ , x^^ . □ 



Using the previous construction, we extend Theorem 3.2 to the case of monomial maps 
that are generically finite after taking quotients by appropriate tori. This extension fits 
perfectly into our setting. 

Theorem 3.4. Let a: {C*Y — )• (C*)'^ be a monomial map with associated integer matrix 
A and let V C (C*)*" be a closed subvariety. Then, 



T{a{V)) = A{T{V)). 
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Suppose V has a torus action given by a rank I lattice A C Z*". Let V' be the quotient 
by this torus action. Let a: V — )• (C*)'^/a(A 0z C*) be the induced monomial map, with 
associated integer matrix A' . 

Suppose A' = ^(A) is a primitive sublattice of Z'^ and that a induces a generically finite 
morphism of degree 5 on V' . Then the multiplicity of T{a{V)) at a regular point w can be 
computed as: 

(3.2) m^ = \ ■ m^- index (L„ n Z"* : A(L^ n Z*")), 

A-v=w 

where the sum is over any set of representatives of points {v' = tt{v) G T{V') \ A'v' = w'} 
given w' = -7t{w) G M'^/(]R 0z A') = M <8)z A'-*". We also assume that the number of such v' 
is finite, all of them are regular in T{V') and L„,L«, are linear spans of neighborhoods of 
V € T{V) and w £ ATiV) respectively. 

Remark 3.5. In case A' is not a primitive lattice, the formula for mm will involve an extra 
factor, namely, the index of A' with respect to its saturation A"''^* in Z'^. In this case, A'^ 
will correspond to any complement of the primitive lattice A"^^* inside Z"^. 



Proof of Theorem 3.4[ The equality as sets follows from Theorem 3.2 To prove the 



formula for multiplicities, we first note that the sum in (3.2) is finite. This follows because 
a induces a generically finite morphism if and only if ker^' n T{V') = {0} if and only if 
A(A^)nA' = {0}. 



From the diagram (3.1) and the surjectivity of a and a, we know that the multiplicity 
formula holds for 1~{Y ) and the morphism a. Pick w' a regular point of 1~{X') and pick 
any point w in the fiber tt~^{w') = if + (M (E> A'). By definition, w is a regular point of 
T(X) and we have = m^' by Lemma [3. 3[ We assume all v' in the fiber of A' at w' are 
regular in T^{V') and L7r(^),L^(^) are linear spans of neighborhoods of tt{v) £ TiV') and 
7r(w) G A'T{V') respectively. 

By construction, the index set in the formula for m^i agrees with the index set in 
formula (|3.2l) for m^. Therefore, our goal would be to show that each summand indexed 



by tt{v) in the formula for rriy^i equals its corresponding summand in formula (3.2) for m. 



We know that m„ = m7r(t,) by Lemma 3.3 Therefore, we only need to prove that the lattice 



indices on each summand are the same, i.e 

(3.3) index (L^ n Z'^ : A{L^ D If)) = index (L^(^) n (A'-^) : A'(L^(^) n A"^)). 

Note that by construction. A' C L^^n Z*^, A C L^,, and likewise A{K) = A' C AiJ^^fMT'). 
Hence, we can consider the quotient of n Z"^ and Ail^^ n T/') by A'. We obtain 

L^nz"^ ^ (L^nz'^)/A' 
yl(U nz*^) ~ ^(u nZ'-)/A'' 

The equality in ( |3.3| ) follows by the identifications (L^ n Z^)/A' = I^t^i^^) n (A'-*-) and 
A(L„ n Z")/A' = ^'(L^(^) n A-L), via projecting to A'-^. □ 
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Theorem 3.6. Given X,Y C C two irreducible varieties, consider the associated variety 
X xY C C^^. Then 

T{X xY)= T{X) X T{Y) 

as weighted polyhedral complexes, with rriaxr = rnamr for maximal cones a C T{X),t C 
r(y), andaxT C T{X x Y). 

Proof. The equahty as polyhedral complexes is a direct consequence of the equality in(„ (/+ 
J) = 'm.ul + 'va.vJ , which follows by Buchberger's criterion and the fact that the generators 
of / and J involve disjoint sets of variables. If we pick u G TX, v G TY regular points, 
then {u, v) is a regular point in T{X x Y). Our goal is to prove the multiplicity formula. 

Given two primary decompositions in„(/) = Cl^Mi C C[x], in^(J) = Clj C C[y], 
we claim that in(„,„)(/ + J) = f]^j{Mi + Nj) C C[x, y] is also a primary decomposition. 
The equality as sets follows immediately, so we only need to show that Mi -\- Nj C C [x, y] 
is a primary ideal. Let Pi C C[x] and Qj C C[y] be associate prime ideals to Mi and 
Nj respectively. Since C is algebraically closed, and Mj and Nj involved disjoint sets of 
variables, it is immediate to check that Pi + Qj C C[x, y] is a prime ideal. Namely, the 
quotient ring C[x,y]/{Pi + Qj) equals (C[x]/Pj)[y] (g)c iC[y]/Qj)[x}i a tensor product of two 
domains over C, hence also a domain. 

Moreover, since both Mj and A'^- involve disjoint sets of variables, we have 

Ann(Mi + Nj) = Ann Mi 0c C[y] + C[x] ®c Ann Nj. 

From this and the fact that P^^ C AnnMj C Pi and Q^j C AnnA'j- C Qj for suitable 
Si, tj G N, we conclude (Pi + QjY'^^^ C Ann(Mi + iV,) C Pi + Qj thus proving by definition 
that Mi + Nj is a (Pj + (5j)-primary ideal. 

With similar arguments we conclude that all minimal primes of in(„^^) (/ + J) are sums 
of minimal primes of in„(/) and in„(J). This follows because, given P,P' C C[x] and 
Q,Q' C C[y] prime ideals, it is straightforward to check that P + Q C P' + Q' if and only 
if P C P' and Q C Q' . 

Let cr, T be maximal cones on T{X) and T{Y), and let u,v be regular points in a and r 
respectively. By definition of multiplicity of a maximal cone, we have 

= m(P,C[x]/in„/) = ^ dim(c[a;]/p)p (C[x]/in„/)p ; 

PeAss{in„(/)) PeAss(in„(7)) 
P minimal P minimal 



=J]] dim(c[y]/Q)Q(C[y]/in„ J)q 



= y~^ dim 



C[x, y] 



QeAss(in„(J)) 
Q minimal 



PeAss(in„ {/))'' 
QeAss(in„(J)) 
P,Q minimal 



Clx,y] 



)p+Q iiiul + in^ J ^+'3 



The statement mcr> 



rriamr follows from the distributive law and Lemma 3.7 



□ 
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Lemma 3.7. Let I C C[x}, J C C[y] be ideals and let P C C[x}, Q C C[y] be minimal 
primes containing I and J respectively. Then 

/C[x, y]\ 

dim(c[5,y]/P+Q)p+Q J = dim(c[x]/P)p(C[x]//)p • dim(c[y]/Q)Q (C[y]/ J)q. 

Proof. Consider the residue fields F = {C[x]/P)p, G = (C[y]/Q)Q, and L = {C[x,y\/{P + 
Q))p+Q. Note that F^^G ^ L via the natural inclusion given by the multiplication map, 
since C is algebraically closed. Likewise, one can easily show that C[j;]// C[y]/J = 
C[x,y]/{I + J) via the multiplication map. We wish to find a similar result for the local- 
ization of these quotients at the corresponding minimal primes. 

For simphcity, cah M = (C[x]/I)p ^ F' and N = {C[y]/J)Q ^ C the corresponding 
finite dimensional vector spaces. Our goal is to prove that M (^c is a free L- vector space 
of rank sr. From the canonical isomorphisms C[x]p 0c[x] C[x]// = (C[x]//)p, C[y]Q ®c[y] 
C[y]/J ^ (C[x]/J)q, we see that M(S)cN = {C[x]/ I)p(S)ciC[y]/ J)q ^ C[x,2/]/(/+ J)[5-i], 
where S = {C[x] \ P){C[y] \ Q) is the multiplicatively closed set consisting of products of 
polynomials, each of which is pure in each set of variables, and which do not lie inside the 
prime ideals P or Q. Similarly, F (S)c G ^ C[x,y]/{P + 

On the other hand, notice that M (S>c N comes with a natural F (^c G-module structure 
via "coordinatewise action." Hence, 

(C[x, y]/{I + J))(P+Q) = L 0^F^^G) {M 0c N). 

From the last isomorphism we see that to prove our lemma it suffices to show that M f^c N 
is a free F (^c G- module of rank sr. The original claim will follow after tensoring with L. 

Let {fi}, {gj} be bases of M and respectively. We claim that {fi gj} is a basis of 
M<Sic^ as an FiSicG-module. It suffices to check the linear independence. We proceed in an 
elementary way, by successively using the linear independence of the different bases of the 
free modules M, N, F and G. Suppose ^ aijfi gj = £ M (gic -^j with aij G F (gic G. 
Write aij = Ylki'^ijkiUk ® vi where aijki G C and Uk,vi are basis elements of the field 
extensions F\C, G|C respectively. Thus, 

(3.4) = X] ® 9j = ^{^ aijkiUkfi) ®c (vigj). 

i,j j,l i,k 

To prove aij = it suffices to show aijki = for all i,j, k, I. By a well-know result on tensor 
algebras (cf. [8; Lemma 6.4]), expression (3.4) implies the existence of elements aju G C, 
ht £ M such that ajith = ^^j. aijkiUkfi for all j, I and <^jitVigj = for all t. Hence, 
rearranging the sum we conclude that ^j(^iO,jitVi)gj = in for all t, which implies 
X^z ^jitvi = G G for all j, t. This in turn implies aju = for all j, /, t. 

Using the condition Yji{YlkO'ijkiUk)fi = ^tO-juht = 0, we have Yjk'^ijki'^k = for all 
i, j, /. Therefore, a^jki = for all i, j. A:, /, as we wanted to show. □ 

Corollary 3.8. Given X,Y C two projective irreducible varieties none of which is 
contained in a proper coordinate hyperplane, we can consider the associated irreducible 
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projective variety X .Y CF"". Then as sets; 

T{X.Y) = TiX) + T{Y), 
where the sum on the right-hand side denotes the Minkowski sum in M"+^ . 

As one can easily imagine, this set-theoretic result is motivated by (and is a direct 
consequence of) Kapranov's theorem [71 Theorem 2.2.5] (i.e., the fundamental theorem of 
tropical geometry) and the fact that valuations turn products into sums (see \22\ Theorem 
2.3] for the precise statement). The novelty of our approach is that under suitable finiteness 
condition of the monomial map defining Hadamard products, we can effectively compute 
multiplicities of regular points in T{X . Y) from multiplicities of T{X) and T{Y). It is 
important to mention that this finiteness condition holds for the example we are studying 
in this paper. Moreover, we are not claiming that T{X . Y) inherits a fan structure from 
T{X) and T{Y). In general, it might happen that maximal cones in the Minkowski sum 
get subdivided to give maximal cones in T{X . Y) or, moreover, the union of several cones 
in the Minkowski sum gives a maximal cone in T{X . Y). 

Example 3.9. It may seem surprising at first that the combinatorial structure (e.g. /- 
vector) of the Newton polytope does not follow easily from the description of the tropical 
hypersurface as a Minkowski sum of two fans. Moreover, the number of edges of the 
polytope (and even the number of vertices) may exceed the number of maximal cones of 
the tropical hypersurface given as a set. To see this in a small example, consider the 
tropical curve in M'^ whose six rays are columns of the following matrix 

1 1 1 1 1 -5 \ 
112-4, 
10 2 1-4/ 

and consider the Minkowski sum of the fan with itself. This tropical hypersurface is 
described as a union of 15 cones (or as a non-planar graph in with 6 nodes and 15 edges), 
but the dual Newton polytope has 16 vertices, 25 edges, and 11 facets. If we intersect the 
tropical hypersurface with a sphere around the origin, we would see the planar graph in 
Figure [2} 




Figure 2. A tropical surface in M.^ described as a collection of 2- 
dimensional cones in or as a non-planar graph in 
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The planer regions correspond to the 16 vertices. The black dots correspond to the 
columns in the above matrix and the arcs between them correspond to cones generated by 
them. The nodes in the graph correspond to the facets in the Newton polytope. Six of 
these facets correspond to the black dots in Figure [2] and are the 6 nodes in the non-planar 
graph description of the tropical hypersurface. The remaining five facets correspond to 
the missing intersection points between the edges of the non-planar graph in the picture. 
Adding these 5 nodes to the graph will give us a planar graph with 11 nodes and 25 edges 
that encodes the fan structure of the tropical variety and the combinatorics of the Newton 
polytope. 

If we had started instead with a tropical curve whose six rays are ±ej for i = 1,2,3, 
then the dual polytope would be a cube with /-vector (8, 12, 6). □ 



Due to the lack of a fan structure in our description oi X . Y, Corollary 3.8 gives 
no estimate for the number of maximal cones in the tropical variety X .Y, where the 
fan structure is inherited from the Grobner fan structure of the defining ideal of X .Y. 
Moreover, this fan structure is infeasible to obtain in general. Hence, in the hypersurface 
case we have no estimate on the number of edges of the dual polytope to the tropical variety 
T{X . Y) and, as a consequence, no estimate on the number of vertices of the polytope. 
As the previous example illustrates, the description of T{X .Y) as a collection of weighted 
cones of maximal dimension contains less combinatorial information than the fan structure 
does and hence, the computation of the dual polytope becomes more challenging, as we 
show in Section m 

We now describe the computation of the tropical variety T(A^) of our model Ai. By our 
discussions in Section 2l we know that the defining ideal of X = Sec{F^ xP^ xP^ xP^) C P^^ 
is generated by the 3x3 minors of the three fiattenings of 2x2x2x2 matrix of variables 
(Pijki), for a total of 48 generators. Since X is irreducible, we can use gf an [1S\ to compute 
the tropical variety T{X). 

The ideal I{X) of C[poooOi • • • iPiiii] is invariant under the action of i?4, and gf an can 
exploit the symmetry of a variety determined by an action of a subgroup of the symmetric 
group §16- For this, we need to provide a set of generators as part of the input data. The 
output groups cones together according to their orbits. 

The tropical variety T{X) G M}^ has a lineality space spanned by the rows of the 
following integer matrix: 

/lllllllllllllll 



(3.5) A 



000000001111111 
000011110000111 
,001100110011001 

Voioioioioioiolo 



where the columns correspond to variables Pijki, for i,j,k,l £ {0,1}, ordered lexico- 
graphically. As we explained already in this section, we can identify this linear space 
with the maximal torus acting on the variety X and hence on X . X. A set of gen- 
erators of the corresponding lattice giving this action can be read-off from the param- 
eterization. More precisely, consider the morphism of tori /3: (C*)^ — )• (C*)^^ sending 
(io,...,t4) I—)- . . . , i*"^^), where each rrii is of the form {l,Vi), where Vi runs over 
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all sixteen vertices of the 4-cube. Then, one can check that the closure of the image of 
j3 in C^^ is the affine cone over the Segre embedding P-"^ x P-"^ x P-"^ x P-*^ ^ P-*^^. More 
precisely, given a generic point in the image of /3, we have tot\t2t^t''^ = XxiUjZkWi, where 
(xo : xi) = (1 : ti), (yo : Vi) = (1 : ^2), {zq : zi) = (1 : is), {wq : wi) = (1 : t^) £ P^ and 
A = to e M. 

The gfan computation confirms that the tropical variety T{X) inside M}^ is a 10- 
dimensional polyhedral fan with a 5-dimensional lineality space. After moding out by 
the lineality space, the /-vector is: 

(382, 3436, 11236, 15640, 7680). 

Regarding the orbit structure, there are 13 rays and 49 maximal cones in T{X) up to 
symmetry and all maximal cones have multiplicity 1. 

According to Corollary 3.8, the tropical variety of the model is (as a set) 

T{M) = T{X . X) = T{X) + T{X). 

Since we know that this will result in a pure polyhedral fan, we only need to compute all 
Minkowski sums between pairs of cones of maximal dimension. For this step we use the 
i?4 group action. There is a natural (coordinatewise) action of i?4 x on 1'{X) x T{X) 
that translates to a i?4-action on 'T{X) + 7~(X). Therefore, to compute the Minkowski 
sum of maximal cones, we first consider 49-7680= 376320 pairs (o"i,o"2), where ai is taken 
from a set of representatives of the 49 orbits of maximal cones, and a"2 is taken from 
the set of all maximal cones. We discard the pairs (o"i,(T2) for which ui + a2 is not of 
maximal dimension 15. After this reduction, the total number of maximal cones computed 
is 92 469. By construction, this list of 92 469 cones contains all representatives of the orbits 
of maximal cones in T{X . X). But they do not form distinct orbits. Some cones appear 
twice in the list as o" -|- r and t + a, and this the only possibility except for 4 512 cones 
which arise from two different pairs, plus their flips. That is, ai + Ti = a2 + T2 where 
both pairs differ only by an interchange of a single pair of extremal rays (ri, r2) G (cri, ri): 
i.e. (T2 = (o"i \ {ri}) U {r2} and T2 = (ri \ {^2}) U {ri}. Some cones a have non-trivial 
stabilizers in B4, so there are cones a + ti and a + T2 in the same orbit. The dimension of 
the maximal cones in T{A4) confirms that is a hypersurface. 

The total number of orbits of maximal cones is 18 972, and each orbit has size 96, 192, 
or 384. We then let the group act on each orbit and obtain 6 865 824 cones of dimension 



15, the union of which is the tropical variety T{A4), as predicted by Corollary 3.8 We 



do not have a fan structure of T{^A). Nonetheless, we can compute the multiplicity 
of T{M) at any regular point using Theorem 3.4 because our matrix A is of the form 
{I16 I Iw) £ Z^^^^^. After taking quotients by the respective maximal torus acting on 
each space, the map X' x X' ^ X' . X' , is generically finite of degree two. In practice, the 
lattice indices in (3.2) are computed via greatest common divisors (gcd) of maximal minors 
of integer matrices whose rows span the cones in T{X) and T{X x X). More precisely. 

Lemma 3.10. Given a lattice D C Z*', and an integer matrix A C Z*^^^ with rk{A{D)) = 
rk{D), the lattice index index(K 0z ^{D) n Z'^ : A{D)) can be computed as follows. Pick 
{wi, . . . Ws} a minimal system of generators of D over TL, and let B := (wi | . . . | Wg) G 
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^rxs_ Then, the index equals the quotient of the gcd of the maximal minors of the matrix 
A ■ B €z Tl]'^^ by the gcd of the maximal minors of the matrix B. 

Proof. Since M(g)z = M.CdzA{m.CdzD nZ''), the mdex(R (g)^A{D) nZ<^ : A{D)) equals 

the product 

mdex(M A{D) n Z"^ : A{R (g>z D n IT)) ■ index(A(M I? n IT) : A{D)). 

By construction index(]R ^{D) n Z"^ : A{D)) is the gcd of the maximal minors of the 
matrix A ■ B. To prove the result, it suffices to show that index(A(M (gi^ D Ci TIT) : A{D)) 
equals the gcd of the maximal minors of the matrix B in the statement. 

Since y\{A[D)) = rk(L»), this implies that ker A n D = ker A n (R D n IT) = {0}. 
Then: A(M ®i D <r^ IT)/A{D) = (R^zDn 7T)/D, which equals the gcd of the maximal 
minors of the matrix i?, as we wanted to show. □ 

In our case, B is spanned by twenty integer vectors (five from each cone a x 0, x 
r G TX X TX plus the lattices A x 0, x A coming from the lineality space. Call 
and C-r each list of five vectors of a and r. Then, the matrix B in the previous lemma 
equals the block diagonal matrix B = diag ( i?o-, where Bfj = (Co-|A), Br = {Cr\X) 
and A ■ B = (C(j|A|Ct-|A). Thus, the index equal the quotient of gcd(15 x 15-minors of 
(Ccr|C^|A) by the product gcd (lO x 10-minors of {C„\k)) ■ gcd (lO x 10-minors of (C^|A)). 
Each gcd calculation is done via the Hermite (alt. Smith) normal form of these matrices 
|18j . After computing all multiplicities we obtain only values one or two. 

4. Newton polytope of the defining equation 

In this section, we focus our attention on the inverse problem. That is, given the tropical 
fan of an irreducible hypersurface, we wish to computing the Newton polytope of the defining 
equation / = CaxT of the hypersurface, i.e. the convex hull of all vectors a G Z^^ such 
that appears with a nonzero coefficient in /. 

4.1. Vertices and Facets. We will first present the results of our computation before 
discussing algorithms and implementation in the following subsections. Here is the ultimate 
result: 

Theorem 4.1. The Newton polytope of the defining equation of M has 17214 912 vertices 
in 44 938 orbits and 70 646 facets in 246 orbits under the symmetry group B4. 

Among the 44 938 orbits of vertices, 215 have size 192 and 44 723 has size 384. The 
maximum coordinate of a vertex ranges between 14 and 20, and the minimum coordinate 
is either or 1. All but 46 orbits have a zero-coordinate. A vertex can have up to seven 
zero-coordinates. Each vertex is contained in 11 to 62 facets. There are 11800 symmetry 
classes of simple vertices, that is, those contained in exactly 11 facets. The following is 
a representative of the the unique symmetry class of vertices contained in 62 facets each, 
which has size 192: 



(0, 0, 1, 17, 13, 6, 17, 1, 17, 1, 6, 13, 1, 17, 0, 0). 
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We index the coordinates of P^^ by {0, 1}^ and order them lexicographicahy. Since our 
polynomial is (multi)-homogeneous, knowing even a single point in the Newton polytope 
gives the multidegree. We now describe the multidegree of the hypersurface A4: 

Theorem 4.2. The hypersurface Ai has multidegree (110, 55, 55, 55, 55) with respect to the 



grading defined by the matrix in (3.5) 



Now let us look at the 246 orbits of facets. The following table lists the orbit sizes: 



size 2 8 12 


16 


24 


32 


48 


64 


96 


192 


384 


number of facet orbits 12 1 


3 


1 


1 


7 


3 


15 


67 


145 



The coordinates Xijki are naturally indexed by bit strings ijkl G {0, 1}^. The two facet 
inequalities in the size-2 orbit say that the sum of Xijki such that i + j + k + l is even (or 
odd) is at least 32. Each facet contains between 210 and 3 907356 vertices. The unique 
symmetry class of facets containing the most vertices consist of coordinate hyperplanes. 

Using Algorithm [3j we certified that out of the 13 orbits of rays of the 9-dimensional 
tropical variety of the Segre embedding x x P-^ x P-^ P^^, only the following eight 
are facet directions of T{M^): 

(1, 0, 0, 1, 0, 1, 1, 2, 2, 1, 1, 0, 1, 0, 0, 1) 
(1, 3, 3, 1, 3, 1, 1, 3, 1, 3, 3, 1, 3, 1, 1, 3) 
(2, 1, 1, 0, 1, 0, 0, 0, 2, 1, 1, 0, 1, 0, 0, 0) 
(2, 1, 1, 2, 1, 2, 2, 1, 1, 2, 2, 1, 2, 1, 1, 2) 
(3, 2, 2, 1, 2, 1, 1, 0, 2, 1, 1, 0, 1, 0, 0, 0) 

^ «3 J 3 y 3 y «3 J 3 y 3 y 3 y 3 y 1 ^ 3 j 3 y 1. ^ 3 y y 1 ^ 3 ^ 

(-1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 

(-1, -1, -1, -1, 0, 0, 0, 0, 0, 0, 0, 0, -1, -1, -1, -1). 

A complete list of vertices and facets, together with the scripts used for computation, 
are available at 



http : //people . math . gatech . edu/~ j yu67/ImpChallenge/ 



4.2. Computing vertices. We now discuss how we obtained the Newton polytope. We 
will first explain the connection between ?"(/) and NP(/). From the tropicalization T{M) 
of the hypersurface Ai = {p : f{p) = 0} C P^^ we want to compute the extreme monomials 
of /. For a vector w G M^^, the initial form in^(/) is a monomial if and only if w is in 
the interior of a maximal cone (chamber) of the normal fan of NP(/). The tropical variety 
of the hypersurface M is the union of codimension one cones of the normal fan of NP(/). 
The multiplicity of a maximal cone in T{M) is the lattice length of the edge of NP(/) 
normal to that cone. 

A construction for the vertices of the Newton polytope NP(/) from its normal fan T{f) 
equipped with multiplicities was developed in [5j (see also [1] for several numerical ex- 
amples). The following is a special case of [3 Theorem 2.2]. Since the operation T{f) 
interprets / as a Laurent polynomial, NP(/) will be determined from T(/) up to transla- 



tion. The algorithm described in Theorem 4.3 computes a representative of NP(/) which 
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lies in the positive orthant and touches aU coordinate hyperplanes, i.e. / is a polynomial 
not divisible by any non-constant monomial. We describe the pseudocode in Algorithm [T} 

Theorem 4.3. Suppose w G M" is a generic vector so that the ray {w — M>o Cj) intersects 
T{f) only at regular points ofT{f), for all i. Let he the vertex of the polytope V = 
NP{f ) that attains the maximum of {w ■ x : x £ V}. Then the i^^ coordinate ofV^ equals 



E 



where the sum is taken over all points v G T(/) H — M>oej), is the multiplicity of v in 
T{f), and l\ is the i^^ coordinate of the primitive integral normal vector F to the maximal 
cone in T{f) containing v. 



Note that we do not need a fan structure on T{f) to use Theorem 4.3 A description of 
T(/) as a set, together with a way to compute the multiplicities at regular points, gives us 
enough information to compute vertices of NP(/) in any generic direction. 

In Sectionlslwe computed T{f) as a union of 6 865 824 cones. For each of those cones, we 



calculated the lattice index in Theorem 3.4 and the primitive vector which is the direction 
of the edge of NP(/) normal to the cone. There are 15 788 distinct edge directions in 
NP(/). We then pick a random vector vu G M^^ and go through the list of 6 865 824 cones, 
recording the cones that meet any of the rays w — M>o Sj. For each i, we sum the numbers 
ruy ■ \l^\ over all the intersection points v and obtain the i^^ coordinate of the vertex. 



Input: The list of maximal cones, with multiplicities, whose union is the 

codimension one cones in the normal fan of a polytope V C M". An objective 
vector w G M"'. 

Assumption: The objective vector w does not lie in any cone in J^, i.e. the face 

is a vertex. For each i = 1,2, ... ,n the ray w — M>oej does not meet the boundary of 

any cone in J^. 

Output: The vertex that maximizes the scalar product with the objective vector 
w. 

P"" ^ 

for each cone a in J- do 
for i = 1, 2, . . . , n do 

if cj n (if - M>oei) / then 

Vf + rua • Ifj^i, where rria is the multiplicity of a and is the 

primitive integral normal vector to a such that > 

return P'"^ . 

Algorithm 1: Ray-Shooting: computing a vertex of a polytope from its normal fan. 



To obtain the multidegree, we only need one vertex. We computed the first vertex using 
Macaulay 2 p5] in a few days. Our ultimate goal was to compute the Newton polytope 
NP(/), a much more difficult computational problem that took us many more months to 
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Figure 3. Ray-shooting and walking algorithms combined. Starting from 
chamber Co we shoot and walk from chamber to chamber. 

complete. As a first attempt, we bound the number of lattice points in the polytope by the 
number of nonnegative lattice points of the given multidegree. Using the software LattE [3], 
we found that the number of monomials in 16 variables with multidegree (110, 55, 55, 55, 55) 
is 5529 528561944. 

By construction, it is clear that the bottleneck of Algorithm [l] is in going through the 
list of 6 865 824 cones. We can modify the algorithm to produce more than one vertex 
for each pass through the list. We do this in two ways. One is to process multiple objective 
vectors at once and save time by reducing the number of file readings and reusing the 
linear algebra computations for checking whether a cone meets a ray or not. Another way 
to produce more vertices is to keep track of the cones that we meet while ray-shooting, and 
use them to walk from chamber to chamber in the normal fan of NP(/). This is described 
in Algorithm [2] On the polytope V, this means walking from vertex p'^-^^i to P"""* for 
scalars t' > t > corresponding to points between three consecutive intersection points, 
along an edge whose i-th. coordinate is negative. If the vector w is generic, then we can 
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Input: A generic objective vector w G M", the vertex , and the set 

S := {{a, i, t) G J" X {1, 2, . . . , n} X M>o : a H {w - M>oei) = {w - tei}}. 
(This input is typically obtained from Algorithm [TJ) 

Output: The set of all vertices of V with objective vectors of the form w — ta for 
some t G M>o and z G {1, 2, . . . , n}. 

for i = 1,2, ... ,n do 

Let (Ti, . . . , am be the cones that intersect the ray w — M>oei transversely. 
Let ti, . . . , G IK>o be such that {ak,i, ifc) G 5 for A; = 1, 2, . . . , m. 
Order ai, . . . , am so that 

h = • • • = < tyfci + l = • • • = ifc2 < ■ ■ ■ < *fci+i = • • • = tm '■= tki+i ■ 

for j = 1,2, ...,/ + 1 do 

i i <r- primitive integral normal vector to a^j with ^^ ■' > Q; 

V ^ V — \ rriCTfe I • (."''^ , where ko = 1, ki^i = m, and nia denotes the 

multiplicity of a. 

Output V, and an objective vector in the line segment between w — tk^Ci and 
|_ - tkj+idi, where tfc,^2 := oo. 

Algorithm 2: Walking: starting from an objective vector and corresponding vertex, 
compute the vertices obtained by changing the objective vector in negative coordinate 
directions. 



assume that aj and ak are parallel whenever they share an intersection point obtained 
by shooting from w m. a, fixed coordinate direction. So we can use any of the cones in a 
parallel class to compute the edge direction of the wall we walk across. By adding up the 
multiplicities of the cones in each class, we get the lattice length of the edge of V. This 
allows us to compute the coordinates of the vertices dual to the chambers we walk into. 
For each of these vertices found by walking from a known vertex, we also get an objective 
vector in the process. For example, any vector of the form w — tei, where t\, <t < t\.j^i-, 
is an objective vector for the j'-th vertex found in the walk in direction — ej. We take 
^m+i ^ numerical stability, we use exact arithmetic over the rational numbers. In 

particular, we always choose the new objective vectors to be integral. 

Using a new vertex, with its associated objective vector, we can repeat the ray-shooting 
(Algorithm [T]) and walking (Algorithm [2]) again. The picture one should have in mind 
is that walking from chamber to chamber in the tropical side corresponds to walks from 
vertex to vertex in NP(/) along edges normal to the codimension one cones traversed in 
the tropical hypersurface. 

The combination of Algorithms 1 and 2 is illustrated in Figures [3] and H) Starting 
from chamber Co and an objective vector wq, we shoot rays in minus the coordinate axes 
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Figure 4. Walking from vertex to vertex in NP(/) C M^. In dash lines, we 
plot the tropical variety. The picture represents the local structure around 
vo. 



directions. The intersection points are indicated by their defining parameters t*- (note that 
superscripts are omitted in the notation of Algorithn:j2]) . As we explain below, to speed up 
the computation of Algorithm [T] we first precompute the inverses of all suitable matrices 
of the form := (— ei|ri| . . . jris) where {ri, . . . ,rio} are generators of the cone a and 
{rii, . . . , ris} span the lineality space in (3.5). Using this, the condition {wq — Xei) R cr 7^ 
translates to the first eleven coordinates of the solution X of X* = (Mq-)"^ • wq being 
positive. Thus, we can easily use the same systems to test (wq + Acj) n a 7^ 0, just changing 
the sign condition for the first coordinate of X. This small modification allows us to walk 
in sixteen new directions (the positive coordinate axes), and find new adjacent vertices to 
vertex vq starting form objective vector wq. The step updating v in Algorithm [2] should be 

v^v + (Efc,_i<fc<fc, ^^k) ■ ^"'^ instead oi v ^ v -{...) . 

In Figure [3j the parameters A associated to the intersection points in these positive 
directions are denoted by t*-. The dashed arrows indicate the shooting directions. The 
points in the cones correspond to intersection points, whereas the points inside chambers 
are the objective vectors obtained for each vertice as described in Algorithm [2j 

The dual walk in the Newton polytope is depicted in Figure |4j We start walking from 
vertex vq and via shooting we obtain the adjacent vertices V5, vj, vs and vio- Notice that by 
this procedure we miss vertices V4,,vq and vg. However, we do get them if we start shooting 
from known adjacent chambers to Cq. For example, vq can be computed if we shoot rays 
from chamber Cs, followed by a shoot from chamber Cq. Observe that this depends heavily 
on the choice of the objective vector wq. 
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4.3. Implementation. A few notes about the implementation of our algorithms are in 
order. As we started working on the problem, we used Macaulay 2 |12j to do the ray- 
shooting (Algorithm [T]) . This script was fine for our first experiments, but it took three 
days to generate a single vertex of the polytope. It soon became evident that something 
faster was needed if we wanted to compute the entire polytope. 

Our first step was to translate the Macaulay 2 script for Algorithm [T] into Python |17j . 
We chose that language because of its fast speed of development and availability of arbi- 
trary precision integers, which were needed by our program. We always scale our objects 
(matrices and vectors) by positive integers so that our objects have integer coefficients. 
This step is crucial for numerical stability. 

This new implementation brought the running time to about 10 hours. This was a 
remarkable improvement, but as the number of vertices of the polytope grew, we realized 
that something even faster was required. Therefore, we decided to resort to caching: instead 
of computing every inverse for each vector, we precomputed all the inverses and stored them 
using a binary format suitable for fast reading in Python (Pickles). This resulted in a ffie 
of a few tens of gigabytes, but dropped the time required for an individual ray-shooting 
procedure down to under three hours. 

Once the Python prototype was working at a reasonable speed, we translated it into 
C++ |21j . which brought the time required to do ray-shooting for a single vertex to 47 
minutes on modest hardware. Moreover, ray-shooting for multiple objective vectors could 
be performed at the same time, thus amortizing the disk reads. Since we still needed large 
integers, we decided to use GMP [lOj and its C++ interface. 

The procedure for walking is a more or less straightforward translation of the pseudocode 
presented in Algorithm[2j It is still implemented in Python, because it takes a short amount 
of time to walk from a few hundred vertices at a time, and the simplicity of the script far 
outweights the time gains a C++ translation would provide. 



4.4. Certifying facets. We now discuss how to certify certain inequalities as facets of a 
polytope V given by the dual tropical hypersurface T{f)- By the duality between tropical 
hypersurfaces and Newton polytopes, each facet direction must be a ray in the tropical 
variety, equipped with the fan structure dual to V. Lemma [4.4| provides a characterization 
for a vector in to be a ray of T{f) with the inherited fan structure. 

Lemma 4.4. Let w £ M" and T{f) he a tropical hypersurface given by a collection of 
cones, but with no prescribed fan structure. Let d be the dimension of its lineality space. 
Let T-L = {(Ti, . . . , CT/} be the list of cones containing w. Let qi be the normal vector to 
cone Gi for i = 1, . . . , / . Then, w is a ray of T{f) if and only if {qi, . . . ,qi} generates a 
{n — d — \)- dimensional vector space if and only if w is a facet direction of NP{f). 

Proof. The vectors {gi, . . . ,qi} are precisely the directions of edges in the face of V := 
NP(/). Since the lineality space of T(/) has dimension d, the polytope V has dimension 
n — d. The face is a facet of V if and only if gi, . . . , g; span a [n — d — l)-dimensional 
vector space. □ 



AN IMPLICITIZATION CHALLENGE FOR BINARY FACTOR ANALYSIS 



21 



For any objective vector w G M", we can compute a vertex in the face by applying 
ray-shooting (Algorithm [T]) to a generic objective vector w' in a chamber of the normal fan 
of V containing w. If we know that w is in fact a facet direction of "P, then any vertex in 
gives us the constant term a in the facet inequality w ■ x < a. This is used in Algorithm [3] 
for checking if a given inequality is a facet inequality of V. This step will be essential to 
certify that our partial list of vertices is indeed the complete list of vertices of the polytope 



V. We discuss this approach in Section 4.5 



Input: An inequality w ■ x < a and a tropical hypersurface (dual to polytope V) 

given as a collection J- of maximal cones. 
Output: True if the inequality is a valid facet inequality of "P; False otherwise. 

for (J G do 
if It; G cj then 

|_ A/" ^ U {normal vector to a}; 

if dim(A/') < n — d — 1 then 
[_ Output False 

else 

w' vector in the interior of a chamber containing w 
Compute the vertex V^' using ray-shooting (Algorithm [l| . 

if w ■ = a then 
|_ Output True 

else 

1^ Output False 

Algorithm 3: Facet certificate: Check if a given inequality defines a facet of a polytope 
given by its normal fan. 



We now explain how to obtain a vector in the interior of a chamber containing a facet 
direction w. We start by applying a modified version of Algorithm [T] with input vector w 
and when we choose to shoot rays only in direction — ei. Since w is a ray of the tropical 
variety given by the collection F, it belongs to some cones {ri, . . . , Tg} in F. Let ai, . . . , am 
be the cones we intersect along the — ei direction (we allow intersections at boundary points 
of each cone). Note that we only pick those cones with ^ 0. 

Now, we use Algorithm [2] with input vector w and the set S corresponding to the cones 
(Ti, . . . ,(Tm and coordinate 1. We assume (cjfc, l,^fc) are ordered in increasing order, with 
all tk > 0. We have two possible scenarios: either 5 is a subset of {0} (that is, either the 
empty set or the set {0}) or it contains a positive real number. In the first case, we pick an 
objective vector wi = w — tei for a positive number t (for numerical stability, we choose 
t to be a big rational number). In the second case, pick a number t between zero and the 
first positive number tj from S and let wi = w — tei. 
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Third, we check if any cone in T contains wx or not. If not, then we let w' = wi. If yes, 
by the balancing condition, this means that there exists a maximal cone in the tropical 
variety containing both wi and w. Note that this cone may be obtained by gluing and/or 
subdividing some cones in J^. In this case then we proceed as above, replacing the original 
input vector w by wi and shooting rays using coordinate 2 instead of coordinate 1. We 
repeat this process with all coordinates if necessary. Unless we have Wi not contained in 
any cone of at step i we are guaranted to have a cone containing w,wi, . . . , Wi-i,Wi by 
construction. By dimensionality argument, at most in sixteen steps, we obtain a vector 
Wi not contained in any cone of J-. This vector will be the objective vector w' from 
Algorithm [3| 

4.5. Completing the polytope. Once the ratio of new vertices computed with ray- 
shooting and walking decreases, the next natural question that arises is how to guarantee 
that we have found all vertices of our polytope. To answer this question, we construct the 
tangent cones at each vertex and try to certify their facets as facets of V. 

Definition 4.5. Let V he a full-dimensional polytope in and v a vertex of V. We 
define the tangent cone ofV at v to be the set: 

TJ' := V + M.>q{w — V : w & = V + M>o(e : e edge of V adjacent to v). 

By construction, is a polyhedron with only one vertex and 'P={^^ vertex of 
particular, an inequality defines a facet of V if and only if it defines a facet of one of the 
tangent cones. 

Let Q be the convex hull of the vertices of V obtained via Algorithms [T] and [2] Our goal 
is to certify that Q = V. We proceed as follows. For each vertex v oi Qwe wish to compare 
the tangent cones and 7^. Since Q has over seventeen million vertices and has no 
symmetry, straightforward convex hull computations are infeasible. If = then the 
extreme rays of would be edge directions of V, which we have already computed as the 
normal directions to the maximal cones of the tropical hypersurface, and which are 15 788 
in total. For a fixed vertex v € Qwe compute all differences w—v for all vertices w of Q and 
test which of these vectors are parallel to edges of V. The number of such edge directions 
in Ty^ is expected to be very small (usually under 30 in practice) . Let C^'^ be the convex 
hull of V and all rays along the edge directions of V in T^. So we have C?''^ C %Q and we 
can test if C^'^ 5 by computing facets of C^'^ with Polymake [llj. If C^'^ 5 T^, 
we use Algorithm [s] to check whether each facet of C^'^ is also a facet of V. In this way, 
we can certify that TJ^ Q C^'^ , hence C^'^ = 7^® = TJ' ■ Certifying this for a vertex v of 
Q in each symmetry class will give us Q = vertex oiQ^v^^ Civ vertex oipV = 'P^ hence 
Q = V. We conclude: 

Lemma 4.6. Let V he a polytope and Q C V be the convex hull of a subset of the vertices 
inV . If all facets of Q are facets ofV, then Q D P, so Q = V . 

If we find that a facet w ■ x < a of C^'^ is not a facet of V from Algorithm [sj then we 
are missing vertices adjacent to f in P in this "false facet direction" tf, so we can perturb 
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w SO that it lies in a chamber of the normal fan of V and use ray-shooting (Algorithm [T]) 
to find a new vertex in that direction. Using this method, we obtained the entire polytope 
in finite number steps. We describe the process of approximating V hy a subpolytope Q 
in Algorithm [4j A schematic of complete tangent cones and incomplete tangent cones is 
depicted in Figure [5} 



Input: A partial list V of vertices of "P, a collection of cones J- whose union is the 
tropical hypersurface, d =dimension of lineality space of the tropical 
hypersurface, and the group of symmetries of the tropical hypersurface. 

Output: A complete list of vertices and facets of V. 

for representatives v of orbits of V do 

Cv ' ^ convex hull of v and all rays in directions w — v where w gV and w — v 

is normal to a cone in J^. 

A ^ facets of Cv ' (using Polymake). 

for z G A do 

if z is a not facet of V hy Algorithm pj then 

a vector in the interior of a chamber whose closure contains z. 
Compute the vertex V^' using ray-shooting (Algorithm 
y ^ y U orbit of V"' 

Break and restart the outermost for-loop with the new V . 
else 

L 5 ^ cSu{z} 
Output vertices V and facets S. 



Algorithm 4: Approximation of "P by a subpolytope Q: Given a partial list of vertices 
of a polytope V with no known complete list of vertices, we construct the subpolytope 
Q generated by this list. We certify when Q equals V. 

In the final stages of the computation, if we find that Cv' is a strict subcone of the 
tangent cone 7^^, we enumerated the rays w — v (with w (z that lie in the difference 
T^\C^'^ . If the number of such rays is small (no more than a few hundreds), we replace 
Cr with the convex hull of Cv ' and those rays (computed using Polymake) and proceed 
as in Algorithm [4j By executing Algorithm [4] in this way, we were able to compute and 
certify all vertices and facets of the polytope. 
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Figure 5. Approximation algorithm. We compute the tangent cones at 
vertices of V and we build a polytope Q dV. We certify if facets of Q are 
also facets of V by Algorithm[3| In the picture, we certify all facet directions 
in containing vertex v but we cannot certify the facet direction w of the 
tangent cone T^- In addition, although we can certify the facet direction 
w' of Q as a true facet direction of 7^, we will not be able to certify the 
constant corresponding to this facet direction of V since we are missing all 
its supporting vertices. The true constant will be obtained using Algo- 
rithm [3l 
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